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Abstract. We prove a dyadic representation theorem for bi-parameter singular 
integrals. That is, we represent certain bi-parameter operators as rapidly decay- 
ing averages of what we call bi-parameter shifts. A new version of the product 
space Tl theorem is established as a consequence. 

1. Introduction 

We study certain bi-parameter singular integrals T acting on some class of 
functions with product domain M"+'" = x M™. Our aim is to prove a rep- 
resentation theorem for them as an average of bi-parameter shifts S: 

(T/, g) = CtE^„E^^ J2 2— ^^(^-'-)^/22— 9)- 

Here the average is taken over all the dyadic grids Vn in M" (parametrized by 
the random parameter Wn) and all the dyadic grids Vm in M™ (parametrized by 
the random parameter Wm)- An exact formulation of everything is given after 
the introduction. Such a representation theorem exists for ordinary Calderon- 
Zygmund operators, and this was proven by Hytonen [TJ in connection with the 
proof of the A2 conjecture for general singular integrals. 

In the one-parameter case such general representation theorems have already 
been utilized several times after HZl. The simplified proof of the A2 conjecture 
by Hytonen, Perez, Treil and Volberg [TO] offered among other things a bit easier 
formulation of the representation theorem. In [8] the author together with Hyto- 
nen, Lacey, Orponen, Reguera, Sawyer and Uriarte-Tuero used the representa- 
tion theorem to study sharp weak and strong type weighted bounds for maximal 
truncations T^. Modifying the metric randomization by Hytonen and the author 
m these representation theorems were lifted to the generality of metric spaces by 
Nazarov, Reznikov and Volberg [14J. Several other applications in the weighted 
context also already exist. 
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The reason why the representation theorem is so useful in the one-parameter 
case is that it can be used to reduce problems considering a general singular in- 
tegral T into purely dyadic problems considering shifts only Because of this, 
there is no particular reason why the applications should be limited to weighted 
questions. This just happens to be the case, since the representation theorem 
was originally developed for this purpose and is still very new a result. This is 
motivation enough for us to develop the analogous theory in the bi-parameter 
case. It would, of course, be interesting to study sharp weighted theory in the 
bi-parameter setting. Our theorem might be useful for this, however, it is a very 
difficult problem. 

Regarding multi-parameter singular integrals, and multi-parameter harmonic 
analysis in general, there is a very large existing theory. After the classical Tl 
and Th type theory by David and Joume [2J and David, Journe and Semmes [H, 
the first Tl type theorem for product spaces was proved by Journe [llj. Regard- 
ing other classical theory, we only mention the work of Chang and Fefferman 111, 
Fefferman [4J and Fefferman and Stein [5J. These three concern singular integrals 
and various spaces, like the BMO, on the product setting. There is a wide body of 
more recent developments of which we here only mention the papers by Fergu- 
son and Lacey ||6l, Lacey and Metcalfe ||T2| and Muscalu, Pipher, Tao and Thiele 
||T3| . These have to do with various multi-parameter paraproducts and character- 
izations for some product spaces. Some bi-parameter paraproducts appear also 
in our proof, and the product BMO space is thus important for us. 

The classical multi-parameter singular integral theory of Journe [11] involves 
formulations written in the language of vector-valued Calderon-Zygmund the- 
ory. Very recently Pott and Villarroya [16] formulated and proved a new type of 
Tl theorem for product spaces. There such vector-valued formulations are re- 
placed by several new mixed type conditions. Here we define our bi-parameter 
operators inspired by ||T6| . The conditions we use are not exactly the same. We, 
for example, do not work with smooth testing conditions. Establishing the cor- 
rect shift structure is our primary task. However, we do get, as a by product, a 
pretty nice form of the product space Tl theorem. 

In this paper we bring the superbly useful machinery of non-homogeneous 
analysis pioneered by Nazarov, Treil and Volberg (see for example IflSl ') to the 
context of bi-parameter theory. The use of non-homogeneous analysis gives addi- 
tional decay for certain matrix elements involved in the expansion of (T/, g) . Just 
like in Hytonen's proof of the representation theorem for one-parameter singular 
integrals, the proof is a Tl style proof with ingredients from non-homogeneous 
analysis. In our case, we have to deal with the much added complexity of the 
bi-parameter situation. Indeed, there are more cases than in the one-parameter 
setting, and many of these are interesting mixed type phenomena. The non- 
homogeneous analysis makes this splitting into cases nicely transparent getting 
rid of rare geometric complications. 
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2. Definitions, strategy and the main result 

Structural assumptions. Let us formulate the Calderon-Zygmund structure of 
our operators. The basic assumption is that if / = fi®f2 (meaning f{x) — 
fi{xi)f2{x2) for X = {xi,X2)) and g = g^^g2 with /i, : R" ^ C, /s, 92: ^ C, 
spt fi n spt g\ — ^ and spt /2 n spt g2 — 0, then we have the kernel representation 

{Tf,9)=l I K{x,y)f{y)g{x)dxdy. 

The kernel X: x R"+™) \ {{x, y) e R"+"* x : xi = yi or X2 = ^2} ^ C 

is assumed to satisfy the size condition 

\K{x,y)\<C- 

and the Holder conditions 
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Furthermore, we assume the mixed Holder and size conditions 
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whenever |xi — x'J < \xi — yi|/2, 

|X(x,y)-X(x,(yl,y2))|<C 
whenever \yi - y[\ < \xi - yi\/2, 

\Kix,y)-Kiix,,x',),y)\<C 
whenever |x2 — < \x2 — y2|/2, and 

\K{x,y)-K{x,{y,,y',))\<C 

whenever |y2 — I/2I ^ |2^2 — I/2I/2. We use, for minor convenience, £°° metrics on 
W and W^. 

We also need some Calderon-Zygmund structure on and R"* separately. If 
f — fi<S> f2 and g — gi<S> 92 with spt /i n spt gi — 0, then we assume the kernel 
representation 

{Tf,g)= / Kf^^g^{xi,yi)fi{yi)gi{xi)dxidy^. 

The kernel Kf^^g^ : {W x M") \ {(xi, 1/1) e x : xi = t/i} is assumed to satisfy 
the size condition 

\Kf,,g2{Xl,yi) \ < C if 2, 92)7—^-7^ 

\xi yi\ 

and the Holder conditions 

1^/2,52(2^1,1/1) - Kf^,g^{x[,yi)\ < C{f2,g2) 
whenever |xi — xi| < \xi — yi\/2, and 

1^/2,32(2^1,^1) - Kf2,92ixi,y'i)\ < C{f2,g2) 



\xi 




\xi - 


-y,\n+S 


\yi 


-y[\' 


\xi - 


-y^\n+S 



whenever \yi — y[\ < \xi — yi\/2. Let \A\ denote the Lebesgue measure of a set A 
and xa be the characteristic function of A. We need the above representations and 
some control for C(/2, 92) only in the diagonal in the following sense. For every 
cube V c R"* we assume that there holds C{xv, Xv) + Cixv, uy) + C{uv, Xv) < 
C\V\, whenever is such a function that sptuy C V, \uv\ < 1 and J uy = 0. 
Functions uy are called ^-adapted with zero-mean (so ^-adapted means just the 
first two conditions on the support and size). We also assume the analogous 
representation and properties with a kernel in the case spt /2 n spt g2 — 0. 
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Boundedness and cancellation assumptions. Dejfine the partial adjoint Ti of T 
by setting 

{Tiih ® f2),gi 92) = {T{gi ® /2), /i 92). 

We assume that Tl, T*l, Ti(l) and T*(l) belong to the product BMO on x W^. 
We recall the definition of this space later in this section. 

We assume that | {T{xk '^Xv),Xk'^Xv)\ < C|/s:| |\/ | for every cube K gW^ and 
V C M™. This is the weak boundedness property for T. 

We also assume the following diagonal BMO conditions: for every cube K c 
M" and V C M."^ and for every zero-mean functions qk and by which are K and V 
adapted respectively (one has spta^^ (Z K,\aK\ < 1 and J ax — 0, and similarly 
for by): 

(i) \{T{aK^Xv),XK^Xv)\<C\K\\Vl 

(ii) \{T{xK®Xv),aK®Xv)\<C\K\\V\, 

(iii) \{T{XK ® bv),XK ® XWI < C^li^llV^I, 

(iv) \{T{xK^Xv),XK^bv)\<C\K\\V\. 



Haar functions. Let /?./ be a normalized Haar function related to J G Vn, 
where Vn is a dyadic grid on M". With this we mean that hj, I = Ii x ■ ■ ■ x In, is 
one of the 2" functions h],r) = (771, ... , t^^) g {0, 1}", defined by 

h] = h]l(^---(^hl, 

where /i?, = \Ii\-^/^xii and /i}. = \Ii\'^^^{xii,i - Xu^r) for every i = 1, . . . , n. Here 
/j i and Jj are the left and right halves of the interval respectively. If 77 7^ 
the Haar function is cancellative: J hj = 0. All the cancellative Haar functions 
form an orthonormal basis of L^(M"). If a G L^(M") we may thus write a — 
Yli&Vn S??e{o i}"\{o}('^' hl[)hPj. However, we suppress the finite 77 summation and 
just write a — ^j(a, hi)hi. Given a dyadic grid Vm on R"* and a cube J G V^, 
we denote a normalized Haar function on Jbyuj. 



Product BMO on M" x M™. Let us be given a dyadic grid Vn in M" and a dyadic 
grid V^ in R"*. We define the square function 



>Xk®Xv 



KeV„ VET) 

n 



\K\\V\ 



1/2 



Then the product Hardy space H^^j,^ (R" x R"*) consists of the locally integrable 
functions / with ||/||^i (r^xr-) = \\Sv„v^f\\i < 00. The dual of this space is 
the product BMO space W^Ov^v^^"" x R"^). 

For us, the condition that 6 G {Tl, T*l, Ti(l), Ti*(l)} is in the product BMO is 
defined to mean that ||^||bmOx)^d^(R"xR'") ^ with every dyadic grid Vn in R" 
and every dyadic grid V^ in : 



pm 
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Bi-parameter shifts. A bi-parameter shift on R" x R"* is tied to a dyadic grid X>„ 
on M", a dyadic grid on R"" and non-negative integers ii,i2,ji,j2- Such an 
operator is denoted by -S*^*^^^ and is of the form 

qhi2jlj2 f _ \ \ A'tl't2jlj2 f 

•^VnVm J ~ ^KV /' 

m 

where 

j^n2jij2f^ J2 J2 ai,i,Kj,j2v{f,hi,^uj,)hi,^uj, 

luhCK Ji,J2<ZV 

e{h)=2-ne{K) e{ji)=2-ne{v) 

l{l2)=2-'2i{K) e{J2)=2-ni{V) 



with 



\0'hl2KJlJ2V 



< 





1/2 




1/2 


\-h\ 


1/2 


-hi 


|l/2 




\K\ 




\v 





Here, of course, Ii,l2 e T>n and Ji, J2 e T>m, and £{I) denotes the side length of a 
cube /. It is also required that all the subshifts 

map L2(R" X R™) L'^{W x R™) with norm at most one. If all of the Haar 
functions hi^,hi^,uj^,uj,^ appearing are cancellative, the shift is called cancella- 
tive. Otherwise, it is called non-cancellative. The last requirement concerning 
the boundedness of all of the subshifts follows from the other conditions for 
cancellative shifts. 

In practice, it is useful to observe that a bi-parameter shift S of type (ii, ^2, ^1,^2) 
related to some dyadic grids is simply of the form 

Ky Ky \^^^\ JkxV 

Ks{x,y)f{y)dy, 



where first of all sptf^AKv^ C {K x V) x {K x V) and y)\ < 1. Moreover, 

Kaj^y is constant with respect to x on dyadic rectangles I x J c K xV for which 
£(/) < 2-'H{K) and i{J) < 2-^H{V), and X^^^ is constant with respect to y on 
dyadic rectangles IxJcKxVior which £(/) < 2-'H{K) and t{J) < 2-^H{V). 
Note also that clearly 
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2.1. Random dyadic grids and the basic averaging formula. Let Wn = (tL'^)igz 

and Wm = {wC)j&, where < e {0, 1}" and < G {0, 1}™. Let P° and be 
the standard dyadic grids on M" and respectively. In M" we define the new 
dyadic grid I?„ = {/ + Y.i: 2-^<e(i) 2"'< : ^ e } = {/ + m„ : / G P° }, where we 
simply have defined / + w„ := J + X]r2-'<^(/) The dyadic grid Vm in 

is similarly defined. There is a natural product probability structure on ({0, 1}")^ 
and ({0, 1}™)^. So we have independent random dyadic grids Vn and Dm in M" 
and respectively. Even iin = m we need two independent grids. 

A cube I eVn is called bad if there exists I e so that £(/) > 2''£(J) and 
d{I,di) < 2£{I)^-e{iy-^^\ Here 7„ = (5/(2n + 25), where 6 > appears in the 
kernel estimates. One notes that TTg^od := ^wA^ + Wn is good) is independent of 
/ G "D^. The parameter r is a fixed constant so that TTg^^^, TTg^^^ > 0. Furthermore, 
it is important to note that for a fixed / G the set I + Wn depends on w,^ with 
2"* < £(/), while the goodness of / + Wn depends on w'!^ with 2"* > £{I). In 
particular, these notions are independent. Analogous definitions and remarks 
related to Vm hold. 

We prove the basic averaging formula of Hytonen [7] but in the bi-parameter 
setting. This is the only part of the proof where probabilistic arguments are 
needed, and here independence plays a big role, even more so in the bi-parameter 
setting. We note that the functions / and g in this paper are always taken from 
some particularly nice dense subset of functions. 

2.1. Proposition. There holds 

{Tf,g) = CE Xgood{smaUer{Ii, l2))Xgood{smaller{Ji, J2)) 

m 

{T{hj^ (g)Uj-,),hj^ ®uj.,){f, hi^ (g)Uj^){g,hi^ ® wja), 
where E = E^„E^„ and C = l/(7r;„,7r™ ,). 

2.2. Remark. Here all the appearing Haar functions are, of course, cancellative and 
we recall that the finite summations over the 2" — 1 or 2™ — 1 different cancellative 
Haar functions per cube are simply suppressed from the notation. 

Proof. Define (/, hi)i{y) = J f{x, y)hi{x) dx, y G M™. We may write 

so that by independence 

{Tf,g) = E^^ Y {T{h ® (/, hi^+^Ji),g) 

= -^^E^n Yl Xgoodih + Wn){T{hi^+^^ ® {f,hi,+n,Ji),g)- 
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After expanding g similarly as / above, one sees that this equals 

-^E^n Yl ^good(^i + '^n){T{hi^+wu ® if, hi^+wji), hi^+w„ ® {g, hl2+w„)l) 

= -^E^„ Yl Xgood(A + Wn){T{hi^+yj^ ^ (/, hi,+^Ji), hi^+yj„ ^ {g, hj^+^Ji) 
^good h,i2evo 
t{h)<t{h) 

e{h)>e{i2) 

Here we again used independence in the latter sununation. Comparing to the 
trivial representation 

we conclude that 

^(/l)<£(/2) 
^(/l)<^(/2) 

First expanding g and proceeding like above one gets the symmetric formula 

^(/2)<^(/l) 

e{i2)<e{ii) 

Splitting the trivial representation in to these two parts allows us to conclude that 

{Tf, g) = E Xgood (smaller (/i, l2)){T{hi, (/, hi,)i), hi, {g, hi,)i). 

We now expand on M™. One may write 
so that 

JieVm 



REPRESENTATION OF BI-PARAMETER SINGULAR INTEGRALS BY DYADIC OPERATORS 



9 



We may then follow the recipe from above: insert this to the above formula for 
{Tf,g), add goodness to Ji by independence, expand hi^ (S> {g^hi^)]^, split the 
summation to l{Ji) < i{J2) and i{Ji) > ^(^2)/ remove the goodness from Ji in 
the latter summation by independence and, finally, compare to the appropriate 
trivial identity. One also does the symmetric thing, where one first expands hj^ <S> 
{g, hi^)i and adds the goodness to J2. Combining these gives the claim of the 
proposition. □ 

2.3. Remark. One may also use full expansions like / = '^Zi^&v^ I^Jigx>„(/' ® 
uj^ ) hj^ (g) M in the beginning of the proof. Following the usual trickery this leads 
to the formula 

(T/,^) = ^E E Xgood(smaller(/i,/2)) 

\ood I^J^&Vn Jl,J2eVm 

{T{hi^ (g) ujj, (g) uj.^){f, hi^ (g) uj^){g, hj.^ (g uj^). 

Here it may at first seem that there is no longer enough independence to add the 
goodness to Ji. However, one may simply write the summation as 

XI Yl Xgood(smaller(/i, /2))(T(/t/, ® ujj,gi^){f, hj^ ® mjJ, 

m 

where one realizes that 

gi2 = Y {9: (g uj^)hi^ (g uj^ = hi^ (g {g, hi^)i 

does not depend on Wm- Then one may add the goodness to Ji using indepen- 
dence and repeat the basic recipe to get the proposition. 

Strategy and formulation of the main theorem. We fix the random variables w„ 
and Wm which fixes the dyadic grids Vn and Dm respectively. Then we study the 
summation 

5Z X] {T{hi^(guj^),h:^(guj^){f,hi^(guj^){g,hi^(guj^). 

i{h)<t{l2) l{Jl)<t{J2) 

I\ good Ji good 

We more often than not suppress from the notation the important fact that /i and 
Ji are good. Then we perform the splitting 

i{h)<l{l2) i{h)<l{h) hCh h=l2 i{h)<e(h) 

(i(/l,/2)>f(/l)T"f(/2)l-T" d(/l,/2)<^(/l)T"^(/2)l-T" 

7in/2=0 

and similarly for the summation over the grid T>^. Here ^(^4, B) denotes the 
distance of the sets A and B (recall that we use the metric). The first sum is 
the separated sum, then we have the inside sum, the equal sum and the nearby 
sum. The summation over both the grids is split in to various types which also 
includes several mixed types. The list is: separated /separated, separated /inside. 
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separated /equal, separated /nearby, inside /inside, inside /equal, inside /nearby, 
equal/equal, equal/nearby, nearby /nearby and some symmetric mixed sums. It 
seems reasonable to deal with these separately. 

Note that actually the mixed sums where < i{l2) and i{Ji) > ^(-^2) or 
£{Ii) > 1(12) and i{Ji) < ^(^2) are not completely symmetrical to this case. How- 
ever, the relevant difference is only in the full paraproduct that appears in the 
corresponding inside /inside part. There one gets a bit different paraproducts, 
which are related to the assumptions that Ti(l) and T^{1) belong to the product 
BMO of R" X M™. We comment more on this on Remark [72l 

The goal is to represent all of these different parts as a sum of shifts with a 
good decay factor in front. Combining all these cases together leads to our main 
theorem: 

2.4. Theorem. For a bi-parameter singular integral operator T as defined above, there 
holds for some bi-parameter shifts 5*^^^^^ that 

where non-cancellative shifts may only appear if [11,12) = (0, 0) or (ji, J2) = (0, 0). 

2.5. Corollary. A bi-parameter singular integral T as defined above is L"^ bounded. 

We note that all of the appearing non-cancellative shifts will have a certain 
paraproduct structure, and this structure is explicit in the proof. For example in 
ilSH , where the one-parameter representation theorem is applied, it is important 
to know the explicit structure of the non-cancellative shifts. 

The rest of the paper is dedicated to the piece by piece proof of this theorem. 
We use X < Y io mean X < CY for some constant C and X ~ F to mean 
^ ^ ^ ^- Of course, we cannot absorb just any constants, but only ones 
that depend on the dimensions or the various constants from the assumptions 
concerning T. 



3. Separated /SEPARATED 

Let hV I2 = nKev,.,KDhuh ^ ^^^d ^1 V J2 = r\vGV^,VDJ^uJ2 ^- separa- 
tion conditions together with goodness imply V hY''^" ^ d{Ii, I2) and 
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Let us write 

E E 

<i(/l,/2)>^(/l)T"^(/2)l-T" d{Jl,J2)>e{Jlprne(J2y-lm 

= EEE E E 

i2>lil>i2 KeVn d(/l,/2)>^(/l)T"^(/2)l-T" d{Jl ,J2)>e{Jl)^ra £(J2y—fm 

j2>ljl>j2VeVm hVl2=K JiVJ2=V 

e{h)=2-ne{K),i{l2)=2-i2l(K)e{Ji)=2-n£{V),e{J2)=2-J2£{v) 

3.1. Lemma. For Ji, J2, Ji, J2 in the above summation, we have the estimate 

\{T{hi^®uj^),hi^ ®u,j.,)\ 

\K\ \V\ \i{K)J \i{V)) 

Proof. Given a cube / we denote by c/ its center. We may write 

{T{hi, ®uj^),hi^®uj^) 



K{x,y)hi^{yi)uj^{y2)hi^{xi)uj2{x2)dxdy, 

IlXJi -'/2XJ2 

where we may, using cancellation, replace K{x, y) by 

K{x,y) - K{x, (1/1, cjj) - K{x, (c/,,1/2)) + K{x, (c/,,cjj). 

Since \yi - c/J < i{h)/2 < \(i{hyH{hY-^- < d{h,h)/2 < \xi - ci,\/2 and 
similarly ||/2 — cj-^ | < |a:;2 — cj^\/2, we have 

\K{x,y)-K{x, (1/1, cjj) - K{x, {ci,,y2)) + K{x, (c/i,cjj)| 
< 





5 


\y2 - Cj 




\5 


\xi - C/J' 


n+S 


\X2 - CjJ 


\m+d 



< iihfdih, /2)-"-'£( Jl)'rf( Ji, J2)-'"-^ 

< £(/i)'^[£(/i)^"^(;r)i-''"]-"-^£(Ji)^[£(Ji)^'"£(\/)^-^-]-'"-'^ 

= i{hY/H{K)-^/^\K\-H{Jif^£{V)-^/^\V\-\ 

Here we used i{Ky~^" < d{Ii, I2) and 7^77, + 7^^ = 6/2 (and the analogous 

estimates involving Ji, J2, and m). Recalling the normalization of the Haar 
functions and the fact that /£{K) = 2'^^ and £( Ji) /£{V) = 2-^i completes the 
proof. □ 

We write 
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Dejfine 

0'iii2KJiJ2V — (-72-n^/22-ii'5/2 
if all the various goodness and separation conditions appearing in the summa- 
tions are satisfied, and otherwise set 0/1/3^^1^2^^ — 0- This enables us to write 

e{h)<e{i2) e{Ji)<e{J2) 

(/, /i/i ®uj^){g,hi^®uj^) 

in the form 

i2>l ii>i2 Js", V 

J2>1 J1>J2 

where 

^m^jU2 J ^ ^ ^ ai^i^KJiJ2v{f: hi^ (8) iiji)/i/2 (g) uj^ 

h,l2CK Ji,J2CV 

e{h)=2-he{K) i(Ji)=2-he{v) 
l{l2)=2-'2i{K) e{J2)=2-ni(y) 



with 



\0-hl2KJiJ2V 



< 







h\ 


|l/2 


\Jl\ 


|1/2| 


1^2! 


|l/2 




\K 


1 1^1 



The corresponding bi-parameter shift with indices ii, i2, ii, J2 is by definition 

giii2jij2 J _ ^ ^ A^*^''^''^ / 
K,V 

4. Separated/ INSIDE 

As Ji C J2, there is a child J2,i of J2 such that Ji c J2,i. We decompose 

(T(/i/i (g) MjJ, /i/2 ® uj^) = {T{hi^ (g) MjJ, /1/2 (g) SJ1J2) 

+ {uj2).Ji{Tihj^ (g)ujj,hj., (g) 1), 

where sj^j^ = Xjp,Juj^ - {uj^)j^J. The relevant properties of sj^j^ are \sj^j^\ < 
2| J2 1 "^/^ and spts C J21. 



We write 



E E 



e(h)<eii2) J1CJ2 

d(/l,-f2)>^(/l)^"^(/2)l-^" 



EEEEE E E 

i2>lil>i2 jl>l KeT>„ J2eVm d(/l,/2)>^(/l)T"^{/2)l-'l'" J1CJ2 

hvi2=K eiJi)=2-ne(j2) 
e{h)=2-ne(K),e{i2)=2-i2e(K) 
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4.1. Lemma. For Ii, I2, Ji, J2 in the above summation, we have the estimate 

\{T{hi^ (g) MjJ, (g) sjij2)| 



< 



\i(K)J \e(j,)) 



\K\ \J2\'/^\i{K)J \i{J2) 



Proof. There is good separation by the goodness of Ji if i{Ji) < 2~''£( J2). Indeed, 
in this case there holds d(Ji,J|^i) > 2£(Ji)t-£(J2,i)^-T"' > £(Ji)^™£(J2)^-^'". Then 
we may write 

(T(/i/j (g) uj^),hi2 (g SJ1J2) 



/ / K{x,y)hi^{yi)uj^{y2)hi^{xi)sj^j^{x2)dxdy, 



and replace K{x,y) by K{x,y) - K{x, {yi,cjj) - K{x, (0/1,1/2)) + K{x, {ci^cjj) 
using the cancellation of uj^ and hi^ . We may utilize the kernel estimates to get 

\K{x,y) - K{x, {yi,cjJ)-K{x, (0/^,1/2)) + K{x, {cj^,cjj)\ 

<e{hY/'e{K)-'/'\K\-H{j,) 



k2-cjj™+'5' 



This yields 

\{T{hi^ uj^),hi^ (g SJ1J2) 



< 



\K\ U{K)J IJ2IV2 ^ Jj^^ |^2-cjJ-+5^ 
where 

dx2 ^ f dX2 



\rr„ \m+S ~ / , , , |t-„ r- ^ \m+S 

Therefore, we have 

\{T{hi^ ®uj^),hj^ (g sjij2)| 

< i/ir/V2r/v^(/i) y/^iJii^/7^(ji)y/^ 
\K\ \(i{K)) wA(i{j2)) ■ 

We still need to deal with the case 2-''£( J2) < ^( Ji)(< ^( ^2))- This time we split 

{T{hi^ (g) xijj, /i/j (g SJ1J2) = (T(/i7i <H) wji), <g (X3J1SJ1J2)) 

+ (T(/l7i ® Uj^), hi^ (g (X(3Ji)cSjiJ2))- 
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We have that {T{hi^ (g) ujj, hr^ (g) (x3JiSjiJ2)) equals 

/ / [K{x,y) - K{x,{ci^,y2))]hi^{yi)uj^{y2)hi^{xi)sj^j^{x2)dxdy 

JhxJl J l2X{3Jl\J2,l) 

SO we can estimate using the mixed Holder and size estimate that 

\{T{hi^ (g)ujj,hi^ (g) (x3Ji-sjiJ2))| 



< 



< 



|/l|V2|J,|l/2 






K£{K) 


|/l|V2|/,|l/2 






Ki{K) 






\K\ 


Kl{K) 



5/2 



In the term {T{hi^ ® mjJ, hj^ (g) (x(3Ji)='SJiJ2)) we have good separation every- 
where, so the Holder estimate for K yields 



\{T{hi,^ujJ,hi^ ® (X(3Ji)-sji./2))| 

\K\ \e{K)J 



< 



< 



\K\ \eiK)J 
\K\ \e{K)J 



Jl|V^/f(Jl)y/2 



□ 



The above lemma enables us to write 



5Z (^(^^1 ® ^-^i)' ® sJiJ2)(/, /i/i <8) uj^){g, hi^ (g) lijj) 

d{h,i2)>e{hp"eihV-~'r^ 
in the form 

i2>l ii>i2 ii>l 

Next, we deal with the series with the term {uj.-,)j-^ {T{hi^ ® ujj, hj,^ (g) 1). This 
will yield shifts of the type (ii, ^2, 0, 0) which are non-cancellative (their R™ parts 
are paraproducts in a certain sense). As these shifts will be non-cancellative, we 
will also have to worry about their boundedness properties. 
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Write 

i'^J2)ji{T{hi, ® MjJ, hi, ® hi^ ® uj^){g, hi, ® uj,) 



J1CJ2 



2Z ( ^(9, hi, <8) uj,)uj,\ {T(hi, (8) wjj, hi, hi, uj^) 
hi,)i)v{T{hi, (8) uv), hi, (g) /i/, (g) i^y). 



The summands can further be written in the form 

\V\-^^^T{hi, (g) Uv), hi, ® hi, ^ uv)hi, ® 5), 

where Uy = \V\~^^'^xv- Written in this way it is evident that we will have the 
required shift structure of the type (ii, ^2, 0, 0). 

4.2. Lemma. The correct normalization 



IJJ 1/2 1/11/2 .£(J ).5/2 
\{T{hi,^Uy),hi,®l)\< ^ ^1 I ^1 (^) \V 



1 1/2 



holds. 

Proof. Let us first split 

{T{hi, (8) Uv), hi, (8) 1) = {T{hi, (g) -Uy), hi, (g) xay) + (^(^/i ® i^y), <H) X(3y)c)- 
We have 

\{T{hi,(^uv),hi,^X3v)\<\V\-'^^ J2 [\{T{hi,(^Xv'),hi,^X3V\v')\ 

v'ech{v) 

+ \{T{hi,^Xv'),hi,^Xv')\ 

For the first time, we use the kernel representations in M" to write {T{hi, 1 
Xv), hi, (g) xv) in the form 

/ / [Kxv',xvX^i,yi) - Kxv',xv'(^^^^h)]hh{yi)hi,{xi)dxidyi. 
This gives that 



1^1 


1/2 1 


h 


1/2 




K 





I{K). 



< 





1/2 1 


h 


1/2 




K 





I{K)J 

Notice that by the mixed Holder and size estimates for K we have the same 
bound also for the term | {T{hi, ® xv) , hi, ® X3v\v') \, and so there holds 

mhi, ® uv), hi, xawi < (|l|)''Vr/^- 
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The term {T{hj^ ® uy), hj,^ ® X{3V)'=) is in control by the full kernel representation 
and the Holder estimate for K. □ 

These are non-cancellative shifts so we must separately demonstrate the 
boundedness. For this, we prefer to write things in a different way: 

J2^{9, hj,)i)v{T{hi, ® uv), hi, ® hi, ® uv) 

V 

= ^{{9,hi,)i)v{{T*{hi., ® l),hi,)i,uv){{f, hi,)i,uv) 

V 

= C2-^^/2((/, hi,),, Y,{{9, hi,),)v{bi,i,,uv)uv^ 

V ' 
= C2-^^'/mf,hi,)„U,,^,^{{g,hi,),)) 

= C2-^^'^'{Ul^J{f,hi,),),{g,hM 

= C2~^^'/'{hi,^Ul^^^{{f,hi,),),g), 

where bi,i, = {T*{hi, (g) l),hi,)i/{C2^'^^/^) and ^bi,i, is the related paraproduct 
on defined by the general formula 

Ubtt = y~^(a)v(6, uv)uv. 

V 

4.3. Lemma. We have bi,i, G BMO(R"') with the hound 

\\bhI2 ||bMO(IR'") < c 

Proof. Let V be any cube in and a be any function in R"' such that spt a c V , 
|a| < 1 and / a = 0. It suffices to show that 



\h\ 






|l/2 




\K 





mhi, ® a), hi, ® 1)1 < 1^1- 

This is done by splitting 1 = xsv + X(3V')<: and using kernel estimates in a similar 
fashion as before. □ 



4.4. Remark. The strengthening of Lemma 14.21 to the related BMO estimate of 
Lemma l43l requires one to have the control C{uv,Xv) < C\V\ for V-adapted 
functions uy with zero-mean. It is precisely for these type of BMO reasons that 
merely the assumption C{xv,Xv) < C\V\ does not seem to be enough for the 
results of this paper. 

Let us abbreviate 

(«1,«2) 

E = E ■ 

luhCK h,hCK 

e{ii)=2-he(K),e{i2)=2-^2e(K) 
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We are ready to show the boundedness of our non-cancellative shifts of type 

(^1,^2, 0,0). 

4.5. Proposition. There holds 

(ii>«2) 
K h,hCK 

Proof. There holds by orthogonality that 



< 



K h,l2(ZK 



(*2) (ii) 

-EE||En.V,((/,Mi) 

K hCK hcK 
K hCK hcK 

Let be the orthogonal projection from L^(M") to span{/i/^ : Ii d K, — 
2~'i£(X)}. Write also fy{x) = f{x, y). There holds by the boundedness of para- 
products defined by BMO functions and the previous lemma that 



|n5VJ(/,/i/,)i||2< 



< 



1^1 


1/2 1 


h 


1/2 




K 





1^1 


1/2 1 


h 


1/2 




K 





\\{fM,)l\\2 

([ [ \pyy{x)\' dx dy) 



1/2 



Therefore, we have 

(«1,*2) 

II E E ^/.®n^.,.,((/'^/i)i^ 

K Ii,l2(LK 



1 / ^''^ 



1 1/2 



ih) 



\PKfy{x)? dxdy 



l/2\ 2 



(n) 



^E^(Ei^.i)(E/.„/i*/.wi^'*-*) 

V I I T-^K IiCK ^ 

Wkfvi^)? dxdy 



K 

^E 

K 



m J Tan 



WfyWldy^ 



2) 



where we again utilized orthogonality. 



□ 
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We end this this section by concluding that 

X] X] {'^J2)ji{T{hi^ ® ujj, hi^ (g) hi^ ® uj^){g, hj^ uj^) 

e(h)<e{i2) Ji£J2 

12 >i n>i2 

5. Separated /EQUAL 

There holds that 
Indeed, to see this, first estimate 

\{T{hi,®uv),hi,®uv)\<\V\-'^ J2 \{T{hi,®xv'),hi^®xv")\ 

V',V"&<±\{V) 
V'i^V" 

+ J] |(T(/i7,(g)Xy'),^/2®Xv')l]. 
y'ech(v) 

We have by the kernel representation in M" that 



mhi,®xv')M.®Xv')\ < C{xv',Xv' y ^' ' (^) 



< 11/ 

For y 7^ F" the estimate 





1/2 1 


h 


1/2 




is: 





1^1 


1/2 1 


h 


1/2 




K 





follows from the full kernel representation using the mixed Holder and size esti- 
mate of K. 

We may thus immediately write that 

^ ^{T{hi^ (g) uv)-, hi^ (g) uv)U-, hi^ g) uv){g-, hj^ g) uy) 

eih)<e{i2) V 

d(7l,72)>^(/l)T"^(/2)l-T" 

i2>i n>j2 

where in this case 5"i»20o are cancellative shifts. 
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6. Separated /NEARBY 

For the Ji and in the nearby summation it is evident that V — Ji V J2 satisfies 
i{V) < 2^^( Ji). Thus, we may write 

E E 

i(h)mh) e{Ji)<iiJ2) 

d{h,h)>e{h)-<nt{l2Y-'fi d{Jl,J2)<eiJlprne{J2y—lm 

JinJ2=0 

=EEEEEE E E 

j2>lil>i2 jl = l j2 = l K V d(/l,/2)>^(A)T"«(/2)l-T" d{Jl,J2)<e{Jl)^"^e{J2V—'"^,JinJ2=$ 

hVl2=K Jl\/J2=V 

e{h)=2-he(K),e{i2)=2-'2e{K) e{ji)=2-he{v),e{j2)=2-ne{v) 



It is easy to get the required estimate 



1^1 1 






|l/2 




\K 





by using the full kernel representation and the mixed Holder and size estimate 
of K. Therefore, we are able to realize this part in the form 



r n 



i2>l ii>i2 ii=l j2=l 



7. Inside /Inside 

We decompose 

{T{hi^ MjJ, h:^ uj^) = {T{hi^ (g) uj^), sj^j^ (g) sj^j^) 

+ (^J2)ji(^(^/i ® UjJ, Si^i^ (g) 1) 

+ {hj,)j,{T{hj,®ujjA®sj,j,) 
+ {hi2)ii{uj2)jAT{hh (^MJi), 1), 

where sj^i^ = Xii,{hi^ - {hi^)i^^) and sj^j^ = XjiMj^ - {'^J2)j2,i]- The relevant 
properties are spts/^/j C /21/ sptsj^jj C J21, Is/^/al < '^\h\~'^^'^ and [sj^jjI < 

7.1. Lemma. There holds 

Proof. In the case that < 2-H{l2) and £(Ji) < 2-H{J2) one may use the 
Holder estimate of K. In the case 2-H{l2) < ^{h){< ^{h)) and 2-'^£( J2) < ^( Ji)(< 
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£{J2)) one splits 

+ (T(/l/, ® MjJ, (X3/1S/2/2) ® (X(3Ji)-SJiJ2)) 

+ {T{hi, ® MjJ, (x(3/i)-s/2/2) ® {X3J1S.AJ2)) 

The first term is controlled by the size estimate of the full kernel: 

\{T{hi^ ® Mji), (X3/1S/2/2) ® (X3JiSjiJ2))| 

< |/ir^/^U2r^/^ / /" . I J 1-1/21 J 1-1/2 /" f dy2 





|l/2 


l^/ll 


|l/2 




|l/2 


l'/2| 


|l/2 



< r:^ < 



/i J3/i\/i Fi - yil" 7ji JajiVi F2 - 1/2 1 

/2|l/=^Vf(J2)y |J2|V2W(J2)/ ■ 



The two terms after that are controlled using the mixed size and Holder estimates 
of K. The last term is controlled using the Holder estimate of K. The mixed 
cases where 2-'e{l2) < i{Ii){< iih)) and £{Ji) < 2-'-£{J2) or e{h) < 2-H{l2) and 
2-H{J2) < £{Ji)i< KJ2)) are handled similarly. □ 

The above lemma shows that 
can be realized in the form 



00 00 



n=l ji=l 

The part 



/1C/2 J1CJ2 

can be written in the form 



where 



n=l 



t{h)=2-nt{K) 

and fe/^/^ = {T*{sij^ ® l), ^/i)i/C2"*i^/^. Since one can check ||&/ix||bmo(ir'") < 
c|/i|^/^/|A'|^/^, it is similarly as has already been done in the separated /inside 
case seen that ||5'*^°°°/||2 < ||/||2- The proof of the BMO estimate is similar to the 
proof of the previous lemma. 
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Completely analogously one can write 
in the form 

oo 

where ^"Ojio is a non-cancellative bounded shift. 
The last part 

X X {^h)h{'^J2)ji{T{hi, ® lijj, /i/i (8) hi^ (8) lijj) 

collapses to 

^{9)Kxv{T*l,hK®uv){f,hK®uv) = C(n^.i/f./,5), 

K,V 

where 

life/ = X(/)^x^(^' ® Uv)hK (8) My 
K,V 

is a bounded shift of the type (0, 0, 0, 0) for b in the product BMO of R" x W^. So 
here we can set S'oooo = n^,^^^^. Note that the correct normalization for this shift 
would follow just from the various kernel estimates and the weak boundedness 
property. 

7.2. Remark. In the proof of this representation theorem there are paraproducts 
of essentially three different types. We have seen two types already: the full 
paraproduct 

life/ = ^{f)Kxv{b, hx <8) Uv)hK <8) Uy 

K,V 

and some half paraproducts, like 

K hcK 

e{h)=2-h£{K) 

which have a paraproduct part only in the or R"* variable. The third type 
of paraproduct does not surface in our current sum, where < £{12) and 
^{Ji) < ^(<^2)- However, for example in the mixed case, where £{Ii) < £{12) and 
£{Ji) > £{J2), one has in the corresponding inside /inside part the mixed full 
paraproduct 

/ ^ X 1^ ^ ^l^'i^li'^)^ ® Uv){f, Hk ® Xv)XK ® Uy 
K,V 

= ^{Ti{l), Hk <8) uv){f, hx <8) Uy)hK ® uy, 

K,V 
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which is bounded as Ti(l) belongs to the product BMO of x by as- 
sumption. It is rather straightforward and well-known that both of these full 
paraproducts are bounded on if they are defined by functions in the dyadic 
product BMO. This can be proven by duality - see for example [,16il . 

8. Inside/equal 

One splits 

{T{hi^ ® uv), hi^ ® Uy) = {T{hj^ (g) -Uy), ® ""v) + {hh)h{Tihi-^ (g) uv), 1 ® My), 

where s/./^ = Xq,^{hi2 - {hi^)i^^) satisfies spts/^/^ C /^^ and js/./J < 2|/2|"^/^. 
One may write 

^ ^(r(/i/j (g) My), (g My)(/, /l/i (g My) (5', ® «v) 

in the form 

oo 

n=i 

with cancellative shifts. For this one needs that 



|(r(/l/j (g Uv), Si^i^ (g My) I 

Estimate 





1/2 




1/2 



y',y"ech(y) 
y^y" 

+ \{T{hi,®Xv'),si,i,®Xv'] 
y'Gch(y) 

In the case V ^ V" use the full kernel representation. In the diagonal case use 
the kernel representation in M". If < 2^'^£(l2), use the mixed size and Holder 
estimate of K (in the case V ^ V") or the Holder estimate for the kernel i^xy.xv 
(in the case V = V"). Inthe case 2-'"£(/2) < split s/,/^ = Xa/iS/i/a + XCs/i)^^/,/^. 
For V 7^ V" use the size estimate of K for the first term and the mixed size and 
Holder estimate of K for the second term. In the case V = V" use the size 
estimate of K^^,^^^, for the first term, and the Holder estimate of -ft'xi/'.xy/ 
second term. 
One writes 

X] ^^^h)h{T{hi^ (gMy),l (gMy)(/, /l/j ®Uv){g,hi^ ® Uy) 
hgh V 

in the form 

c(5°°°V,^7), 
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where in this case 

V 

and hv — {T*{1 ® uv),uv)2/C. This is indeed a non-cancellative shift of the type 
(0,0,0,0). 

8.1. Lemma. There holds ||bmo(]R") < c. 

Proof. Fix a cube K cW^ and a function a so that spta C K, \a\ < 1 and J a = 0. 
We need to show that | (T(a (g) uy), 1 (8 ^4^) | < We begin with the split 

(T(a (g) My), 1 (g) Uv) = {T{a ® Uv),XK ® Uy) + (T(a®My), X3X\A- ® My) 

+ (T(a ® uv),X{3Kr ® 

There holds 

{T{ai^uv),X3K\K<^uv) <\V\~^ ^ |(7'(a<H)Xy')>X3K\K<E)Xy")| 

y',y"ech(y) 

y'ech(y) 

where 

I {T{a ® xy) >X3x\^ ® Xy) I 

< /" /" \, , dx,dy,- [ [ ^—^dx,dy2<\K\\V\ 



K J3K\K 1^1 ~ yi\ Jv JV" 1^2 — Z/2 



and 

'(a®Xy'),X3i^\i^ ®Xy')l < C'fxy^Xy) , , 

IkJ3k\k \xi-yi 



\{T{a^Xv'),X3K\K^Xv')\ <C{xv',Xv') I I i — n:dxidyi<\K 




Furthermore, we have 



\{T{a(g)Uv),Xi3Kr(S)Uv)\<\V\'^ ^ \{T{a (g) xv'):X{3Kr ® Xv")\ 



y',y"ech(y) 

V'ytV" 



+ ^ \{T{a^xv'),X{3K)c®Xv' 
y'ech(y) 



where 



\{T(a®Xv'),X(3K)'=®Xv" 

'{3KY Fl ~ CKy'^" Jv' Jv" 1^2 - y2 



<\K\-iiKy I TZr^^-l I TZr^.d^2dy,<\K\\V\ 
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and 

f f £(KY 

\{T{a(^Xv'),X{3Ky^Xv')\ <C{xv',Xv') / i [zri dxidyi < \K 

For the first term we again begin with the estimate 

\{T{a®uv),XK^uv)\<\V\-' \{T{a(^xv'),XK®Xv")\. 

V',V"ech{V) 

Let us consider the case V ^ V". In this case we have 



\{T{a(S)Xv'),XK (S>XV")\= / / Ka^^i,{x2,y2) dx2dy2 

Jv Jv" 

<C{a,XK) I I ] ^ — —dx2dy2<\K 

Jv JV" F2 — 1/2 1 

Thus, we are only left with the need for the estimate \ {T{a® Xv)-, Xk ® Xv')\ ^ 
li^'l |\^| - but this is one of the diagonal BMO assumptions. 

□ 

Because of this lemma, one can show, similarly but with a bit less effort than in 
Proposition 1431 that 5°°°° is bounded. 

9. Inside/nearby 

This goes very much so in the same vein as the inside/ equal case. In fact, this 
is easier since the nearby cubes do not intersect by definition. From the series 
with the matrix element {T{hj^ ® mjJ, sj^j^ ® uj^) we get 

oo r ji 

*^ X! X! £ 2-*i^/22-ii'5/2^^nOiij2 gy 
n=i ii=i j2=i 

From the series with the matrix element (hi^)i^ {T{hi^ ® mjJ, 1 ® uj^) we get 

ii=i i2=i 

with bounded non-cancellative shifts. 

10. Equal/equal 

This part can be realized in the form (7(5*°°"°/, g) for a cancellative shift, since 
one can just estimate | {T{hK ® uy), hx ® uy) \ < 1. This estimate is an easy conse- 
quence of the weak boundedness property and the size estimates of our kernels. 
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11. Equal /nearby 
This part is clearly of the form 

jl = l j2 = l 

where the shifts are cancellative. Here one can again just use the estimate | {T{hK® 
ujj, hx uj^) I < 1, which follows just from the size estimates of our kernels. 

12. Nearby/ Nearby 

This part is of the form 

r i\ r ji 

n=i 12=1 ji=i j2=i 

once again because of the easy estimate | {T{hi^ ®uj-^),hi^®uj^) \ < 1. This follows 
from the size estimate for the full kernel. 
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